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Abstract 

A stochastic evolutionary dynamics of two strategies given by 2 x 2 matrix games 
is studied in finite populations. We focus on stochastic properties of fixation: how 
a strategy represented by a single individual wins over the entire population. The 
process is discussed in the framework of a random walk with arbitrary hopping rates. 
The time of fixation is found to be identical for both strategies in any particular 
game. The asymptotic behavior of the fixation time and fixation probabilities in the 
large population size limit is also discussed. We show that fixation is fast when there 
is at least one pure evolutionary stable strategy (ESS) in the infinite population size 
limit, while fixation is slow when the ESS is the coexistence of the two strategies. 



1 Introduction 



Although the notion of evolutiona ry stable strategy (ESS) was worked out for 
finite populations many years ago ( Mavnard Smith . 19881 : Schaffer . 19881 : Neil 
2004), and there are numerous studi es where finiteness and spa tial structure of 
populations are taken into account ( Nowak fc Sigmund . 120041 ) (and references 
therein), models of evoluti onary game dynamics are ge nerally considered in 
populations of infinite size ( Hofbauer k, Sigmund . 20031 ) . To st udy evolution- 



ary game dynamics in finite populations Nowak and coworkers ([Nowak et al 



2004t iTavlor. C. et all 120041 ) generalized Moran's classical population genetic 



model ( Moranl . 1962| ) bv using frequency dependent fitness. While for infinite 
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populations a strategy is define d to be an ESS i f its fi tness is greater than 



the fitness of any rare mutant ([Maxnard Smith! . 119741 ). these recent works 
emphasize that an ESS strategy have to satisfy two requirements in finite 
populations: a) it is resistant against the invasion of rare mutants and b) the 
probability that a rare muta nt strategy overtake s the resident one is sm aller 



than it is for random drift ( Nowak et all 12004 iTavlor. C. et all l2004f ). In 



order to investigate these conditions quantitatively, Nowak determined the 
invasion coefficients (a)) and the fixation probabilities (b)) in the simplest 
2x2 matrix games in the above cited works. They conclude that apart from 
the payoff matrix, the population size plays an equally important role in the 
evolutionary dynamics of finite populations. Also, by proving a set of basic 
theorems, these authors establish a simple classification of evolutionary sce- 
narios in the se games. Alternative definitions of ESS in fi nite populations are 
discussed in f)SchafferL Il98fit IWild fc Tavlor. P. DlEool . 



On the other hand, obviously, the characteristic timescales also play a crucial 
role in these systems. In some situations, although fixa tion is probable, it might 
occur very slowly or rapidly ( Lieberman et al. . 2005| ). For systems with very 
slow fixation, the population typically consists of coexisting strategies, as it 
takes a long time for a strategy to win over the entire population. Conversely, 
systems with short fixation times are often found to be fixated to the preferred 
strategy. Moreover, it is possible in principle that the two strategies, when 
represented by a single mutant individual, fixate with the same probability 
but at different speeds. In this situation the probabilities of fixation would be 
insufficient to describe the system. We show in this paper, however, that this 
latter situation never occurs. In order to show this, we focus on the average 
fixation times of s i ngle m utants in the generalized Moran process introduced in 



f)Tavlor. C. et all 12004). This model is essentially a special case of a random 
walk with arbitrary hopping rates and two absorbing states. Some of our 
results are obtained for this general random walk. 



The paper is organized as follows. After recalling the definition and some basic 
properties of the model in Sec. 2, the exact expressions for the average time 
to overtake the entire population are derived for both strategies in Sec. 3. We 
show that a single mutant following strategy A fixates in the same average time 
as a single B player does in a given game, although the probability of fixation 
for the two strategies are generally different. An alternative direct proof of this 
property is presented in the Appendix. In Sec. 3.1 the neutral game of identical 
strategies is considered as the simplest example. In Section 4 the asymptotic 
behavior of the fixation time and the fixation probabilities are derived in the 
large population size limit. In Sec. 5 we formulate three conjectures making 
connections between the speed of fixation and other characteristics of the game 
dynamics, based on numerical investigations of the derived formulas. Finally, 
the results are summarized and discussed in Sec. 6. 
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2 Model and fixation probabilities 



In the evolutionary game introduced in ([Tavlor. C. et all 120041 ) . the individ- 
uals can follow either A or B strategies defined by the payoff matrix 





A 


B 


A 


a 


b 


B 


c 


d 



This matrix lists the average rewards of players after playing numerous times 
with each other. Namely an A player receives an average reward a when playing 
against another A player, and reward b when playing against a B player. 
Similarly, a B player gets reward c when playing against an A player and 
reward d when playing against another B player. 

Out of the total A" individuals, i of them follow strategy A, and N — i follow 
strategy B. As each individual plays with every other one, the average fitness 
of an individual playing strategy A or B are proportional respectively to 

fi = a(i-l) + b{N-i) 

9i = d + d(N - i - 1) . { ' 

More precisely these fitness values should be normalized by AT — 1, but that can 
be safely ignored as only their ratio appears under the present dynamics. Then 
the evolution of the strategies are governed by the following Moran process 
f Moran , 1962I ). and the fitness values are recalculated after each evolutionary 
step. At each time step an individual is chosen for reproduction proportional 
to its fitness. This newborn individual replaces another randomly selected 
individual, hence the total number of the population A" remains constant. 
Consequently, at each time step, i can change at most by one according to the 
following transition probabilities 



P(i 
P(i 
P(i 



z + 1) 



Mi 
A,- , 



(3) 



where 



A; 



(N - i)Qi % 
ifi + (N — i) 9i N 

ifi N-i 
ifi + (N-i) gi N 



(4) 



Note that the ratio of the hopping probabilities is equal to the ratio of the 
fitnesses /ij/Aj = fa. This Markov chain is essentially a random walk on 
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sites < i < N. A walk starting at site i eventually hits one of the absorbing 
states at or N. We denote the probability of hitting site first by and 
hitting site N first by ef = 1 — e,. Since in one time step the walker stays 
at its position or jumps to a neighboring site, these first passage probabilities 
obey the N — 1 equations 



— H>i£i-\ + (1 — H>% — + A,ej + i 

for 1 < i < N — 1, and the boundary conditions eo = = 1 and ejv = Cq 
The solution of these equations reads as 

Si.N-l + Soi-i 

€i = , ej = . 



Sq,N-1 



Sq,N-1 



where 



k=n 



««=nf i =nf > 

j=l A J j=l J 3 



9j 



(5) 
= 0. 

(6) 
(7) 



and q = 1 ( Karlin fc Tavlor. H. , 19751 Ivan Kamperl Il997 ) . We are particu- 
larly interested in the fixation probability for a single mutant B 



ejv-i 



Sq,n-i 



and that of a single mutant A 



Sq,n-i 



(9) 



The nature of these results are analyzed in the following sections. A single 
strategist A can invade B if its fitn ess is greater than the resident B-s, that 
is if fx > g± (jMaxnard Smithl . Il974f ). Since the probability of fixation isl/N 
in a neutral game, selectio n favors fixation of a s ingle mutant A (that is A 
to replace B) if ef > 1/N ((Tavlor. C. et al. . 2004| ). (Similar condition for the 
invasion of B is /at-i < 9n-i, an d for the fixation of B is ejv_x > 1/N.) Using 
these conditions it has been shown that there are five qualitatively different 
selection scenarios in finite populations for 2x2 matrix games. These different 
scenarios are listed in Table 1. 



The reason of writing B in front of A in the symbols of the different scenarios 
in Table 1 and throughout the paper is that in this way their order is the same 
as that of the pure B state (i = 0) and the pure A state (i = N) along the path 
of the walk. In this way the single arrows represent the preferred direction of 
the walk at each end of the chain. For example B^"^~ A means that in case of 
a single A player (i = 1) the walk prefers to step to the right (first arrow), 
but with a single B player (i = N — 1) it prefers to step to the left (second 
arrow). The double arrows point to the direction in which the absorption is 
more possible than in the neutral game. As an example B^^A means that 
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Class Selection favors 



Speed 







A to invade and replace B 


fast or 


slow- 


B Z 


Z A - 


mutual invasion, A and B to coexist 


slow 






^A: 


mutual invasion, A to replace B 


fast or 


slow 




Z A - 


no one to invade, A to replace B 


fast 








no one to invade or replace 


fast 





Table 1 

The possible combinations of qualitatively different fixation and invasion events. 
Changing the roles of A and B does not alter the scenarios qualitatively. The speed 
of the walk is discussed in Sec. 4.2, and 5. 

a system with a single A player (first arrow) and also a system with a single 
B player (second arrow) ends up in a pure A state more probably than in a 
neutral game. 



3 Average fixation time 



A walk starting at site i eventually gets absorbed into either site or site 
N. We are interested in the mean conditional exit (or absorption) time U 
to site 0, and tf to site N. The problem was formulated and sol ved for the 
simple random walk in ( Landauer fc Biittiker , 19871 : iFisher , 1988f ) in a more 
general framework. The solution for a rando m walk with arbitrary hopping 
probabilities is given in ((van Kampenl . [1997). For completeness we present 
here an elementary derivation using standard methods. 



Let Pi(t) be the probability that the walk gets absorbed into site at time t, if 
starts at site i at time 0. Obviously q = E^o Pi{t)> an d we use the convention 
Po(t) = 8 t) o, and Pjv(t) = 0. By definition, the conditional average exit time 
is 



E tPi(t) 



t=o 



E Pi{t) €i *=o 



(10) 



t=o 



For Pi(t) the following recursive equation holds 



Pi(t) = ^Pi_i(t - 1) + (i - m - Xi)Pi(t - 1) + XiP i+1 (t - r 



for 1 < % < N — 1. As we are interested in the first moment of Pi(t), we 
multiply both sides of this equation by t, and then sum it up from t = to 
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oo. We also apply the identity 

oo oo 

J2t p i(t - !) = E(* + WW = e ^ + !) > ( 12 ) 

t=0 t=0 

[where -Pj(t) = for t < 0] to arrive at 

e^j = ^ei_i(^_i + 1) + (1 — A*i — Ai)ei(ij + 1) + A;e m (t i+ i + 1) . (13) 
Using Eq. (5), and the notation Tj = erfi, the above equation simplifies to 

-6i = HiT^x - (ft + \)Ti + XiT i+1 , (14) 

which, in terms of <7j = r, — r i+ i, becomes 
Iterating this relation gives 

fc=i A ^fc 

Since <t = — ri, we know all for 1 < i < N — 1 as a function of Ti 

= Ti - °n ■ (17) 

n=l 

The value of T\ can be obtained from equation (14) with % — N — 1, noting 
that = 0, 

N-l n 

n = (l-ei)E&ET7. ( 18 ) 

n=l fc=l Afe " fc 

hence the conditional average time (t« = Tj/ej) to reach the origin is given by 

n = (1 - eOQjv - , (19) 
where we used the shorthand notation 

^ = E^ETT- (20) 

n=\ k=l Ak " k 

The conditional average time tf to reach the other absorbing state at site N 
can be obtained similarly through rf = eftf 

r+ = (1 - et)R ~ Ri , (21) 

with 



N-l N-l 



Ri = E Qn-i E 4*- ■ (22) 

n=i+l k=n Ak " k 
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Formula (21) can be also obtained directly from (19) by noting that to arrive 
from site % to the origin in a given chain is the same as to arrive from site 
N — i to site N in a mirrored chain, where all Xj are changed to fiN-j and all 
fij are changed to Ajv-j- 

The expressions for the average exit times are greatly simplified for the bio- 
logically relevant tn~i and . We can rewrite formula (19) as 



N-l 



t n-i — (1 — £n-i)Qn — Qn-i — Qn-i X! \ 

k=i x ^k 



£n-iQ 



N 



(23) 



and substituting expression (6) into we arrive at our final formula 



N-l 

. ■So^-l-Sn.V-l 

tN-l - 2^ 

• " N-l 



(24) 



Following the same steps for (21) leads to an expression for tf which is identical 
to (24), thus we call it the fixation time 



t &x (N) EE t N ^ = i+ 



(25) 



This surprising result states that the mean conditional exit times starting from 
one side and being absorbed into the other side of the chain does not depend on 
the particular side the walk starts, for arbitrary positive transition probabilities 
(it is obviously necessary that all ^ > and A; > 0). We also support this 
intriguing property of the walk by a simple argument in the Appendix. In the 
special case of random walks with constant bias (fj,i = 1 — Aj = const), the exit 
times are symmetric for all initial positions = if as shown in ((Rednerl . 
l200lh . Note also that for t he simp l e random walk (ui — A^ = 1/2) the formulas 
(19) and (21) simplify to (|Fisheri . Il98i iRednerl . Effllh 



U = -{2N-i) , n 



;{N 2 -l 2 ) . 



(26) 



3.1 Fixation time in the neutral game 



In the context of the game theory, the transition probabilities /ij and Aj depend 
on the fitness of individuals using a given strategy, as it is given by Eq. (3). 
A game is given by the payoff matrix (1), and the fitness of individuals using 
strategy A and strategy B are determined by Eqs. (2). The simplest game is 
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the neutral game given by the payoff matrix 





A 


B 


A 


f 


f 


B 


f 


f 



(27) 



and the corresponding transition probabilities are 

i(N-i) , s 

f>i = K = -^p 2 - (28) 

Hence the exit probabilities in a neutral game coincide with that of the simple 
(fair) random walk 

4 = ejv-i = ^ • (29) 

On the other hand, as \i\ + \ < 1, the conditional average exit times are 
different. Using Eq.(24), after some straightforward algebra we arrive at 

tfi x (iV) = =4 = N(N - 1) , (30) 

that is a neutral game fixates three times slower than a simple random walk 
(26) for large N. 

It seems natural to characterize the average "speed" of fixation in a non- 
neutral game by comparing the actual fixation time t^(N) to that in a neutral 
game N(N — 1). If tsx(N) > N(N — 1) then fixation is said to be slow, if 
tsx(N) < N(N — 1) then fixation is fast. It is easy to check that for an A^ = 2 
system tsx(2) = 2 independently of the elements of the payoff matrix. To write 
the fixation time and probability in a closed form for a general payoff matrix 
and for arbitrary A" seems to be a hopeless task, but they can be derived 
asymptotically in the large A" limit. 



4 Large A^ limit 



In this section we derive the large A^ asymptotic behavior of the exact ex- 
pressions for the fixation probabilities Eq.(8-9) and times Eq.(24). What large 
means depends on the actual payoff matrix; the speed of convergence can be 
slow for some particular choice of matrix elements (See Fig. 3 and 4). We 
emphasize that the large A^ behav ior o f a fin ite chain is different from the 
behavior in an infinite population ( Neill . 2004 ). modeled eit her by a walk on 



a sem i-infinite chain or by a replicator equation description (jTavlor. C. et al 
20041 ) . 
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Let us derive the asymptotic behavior of the basic quantity qk first 

9* = nf = expfi:inf) . (31) 

i=l J» \i=l /*/ 

When N is large, the above sum can be written as an integral 

„ T rv , , x(c - d) + d . . 

AT / dx In -4 ^ , 32 

Jo x(a-b)+b v ' 

where y = k/N. Evaluating the integral we arrive at qk = q{y) N , with 

m ,(l)"(^r±. (33) 

b ' f \ _|_ i,9- b ^ a ~ b 



The function g(y) is remarkably simple: it is either convex if a — c < b — d, or 
concave if a — c > b — d. At the boundaries 



1, ofl^lW. 



g(0) = 1 , d(l) = ) a/ . (34) 

a \ a-b 



Under the interchange of players (A ^ B, that is a ^=> d and 6 c) the 
function has the symmetry 

9x^3(2/) = _^ • (35) 



Since = q(y) N , the main contribution to the basic quantity so,iv-i comes 
from the g^ terms with ft/AT « y m ax, where q(y) takes its maximum at y max . 
According to the value of y max t here are four scenarios, in agreement with the 
replicator equation description ( Tavlor. C. et all 12004 ) 



(1) B — ► A: A dominates B, if a > c and b > d; y max = 1 

(2) B < — A: B dominates A, if a < c and b < d; y max = 

(3) B < > A: A and B are bi-stable, if a > c and b < d; y max = y* 

(4) B A: A and B coexist, if a < c and b > d; g(y) has minimum at y*, 
and its maximum is at y max = for 1 > q(l), while its maximum is at 
Z/max = 1 for 1 < g(l), 

where 

^* = ~T~5 — I ■ (*) 

a — c + a — b 

This suggests that the relevant parameters of the large A" behavior are a — c 
and b — d, hence we shell depict our results as a function of these variables in 
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Fig. 1 and 2. Again the arrows in symbols represent the preferred direction 
of the walk at each ends of the chain. B — ► A can be thought of as a short 
form of B A. 

Using the N — > oo limit in expression (33) we find that around the ends of 
the chain 

(£)* , for k < N , 

q(l) N (§) , for iV - fc < iV . 



(37) 



4-1 Fixation probabilities 



In order to calculate the fixation probabilities in the N —> oo limit, we need 
to know the asymptotic form of So,iv-i- Since qk = q^ , we only need the terms 
around the maximum of q(y). In the case B — > A (that is a > c and b > d), 
the maximum is at y = 0, hence, using Eq. (37) 



N-l 




so,n-i = £ 9* ~ E ( I J = [73rf • ( 38 ) 





This makes the fixation probability in the N — > oo limit 

et = — = l-t (39) 

So,JV-l o 

Surprisingly, as this result is valid as long as the maximum of q(y) is at y max = 
0, this is the fixation probability in the case of B A (that is a < c and 
b > d) when additionally q(l) < 1. This means that even when A and 5 
coexist, the fixation probability can be finite, even in the N — > oo limit. In 
this latter case however, as we shall see, the fixation time is exponentially large. 
In the above two cases the fixation probability for the other player becomes 
exponentially small 

*N-i = ^ ~ q(l) N , (40) 

So,JV-l 

where we used Eq. (37). While expression (39) for becomes exact as N — > 
oo, in the expression (40) for ejv-i, we do not know the coefficient of q(l) N 
exactly. In other words we only know that \n(e N _i)/N — > lng(l) as — > oo. 
This is comprehensible as we derived the leading order behavior in both cases. 

Similarly, for B < — A (that is a < c and b < d), and also for B A when 
q(l) > 1, the maximum of q(y) is at y max = 1, and repeating the same steps 
as before we arrive at 

et=(l-±)q(l) 1 - N ~Q(l)- N , ^ = l ~- c ( 41 ) 



10 



In the remaining region, B < > A (that is a > c and b < d), the main 

contribution to so,jv-i comes from the terms around y max = y*, where q(y) 
takes its maximum q* = q(y*) > 1. In the iV — > oo limit the sum So,jv-i can 
be replaced by an integral, which can be approximated by a Gaussian integral 
(steepest descent method) 



Nq* N jy y e W (-K^) ~ q* N \^N . (42) 



SOJV-l 



Thus, in this case both probabilities are exponentially small in the N — > oo 
limit 

4~-r^, e^-^f^Y . (43) 
1 y/Nq* N VN\q*J K ' 



To complete our investigation of the large N behavior of the absorption prob- 
abilities we focus now on the axes (see Fig. 1), that is when either a = c or 
b = d. When both are equal, then Qi/ fi ~ ?j ~ 1, and we asymptotically 
recover the behavior of the neutral game, ef ~ ejv-i ~ 1/N. When a = c and 
b > d, then the maximum of q(y) is at with non-zero derivative, thus the 
probabilities are the same as in the neighboring B — > A region and in the 
B A region for q(l) < 1. The same scenario occurs for b = d and a < c 
as it can be obtained by interchanging the players. 

Significantly different behavior is found on the other halves of the axes. For 
a = c and b < d the maximum of q(y) is at 1 but with zero derivative. Hence 
we should use again the steepest descent method to obtain s ,n-i ~ q(l) N y/~N 
which results in ~ q(l)~ N / \/N and e^r-i ~ l/y/N. The slowly decaying 

on this axes interplays between the fast decaying behavior in the B < >■ A 

region and the constant probability in the B — > A region. By interchanging 
the players it immediately follows that for b = d and a > c, ~ 1/ \fN and 
ejv _! ~ q(l) N /y/N. 

These results are summarized in Table 2 and depicted in Fig. 1. 



4-2 Fixation times 



To derive the large iV asymptotic we use the exact expression for the fixation 
time tfi x given by Eq. (24). Consider first the B — ► A case. Here, the main 
contribution to t Rx comes from the terms around y = and y — 1, where 1/Aj 
develops singularities. Indeed in the N — > oo limit, Aj of Eq.(4) can be written 

as 

\. — y^-y) (44) 



n 



CjV-1 — 1 — 7 



£n-i 



B 



1/N e$ ~ 1/v^ 



a — c 

- 7 7V 



1/VJV 



Fig. 1. Graphical illustration of the different scenarios for the fixation probabilities 
in the large iV limit as summarized in Table 2 (the value of 7 > can be read out 
from Table 2). ef and ejv_i are exponentially small where not specified. The zig-zag 
line in the B — A region refers to the boundary given by q(l) = 1, which is not 
a line in these variables. The probabilities in this region are equal to those in one 
of the neighboring regions. 

where f3{y = i/N) = gi/ fi, and it behaves asymptotically as 



A, 



d 1 . 

- - tor y < 1 

b y 

for 1 - y < 1 . 



(45) 



Let us also investigate the asymptotic behavior of the other terms in Eq. (24). 
As the main contribution to So,at-i comes from the first few terms, the ratio 
So, n -i/ s o,N-i — ► 1 as iV — > 00 for any finite y = n/N. In addition to that, 
close to the singularity at y = 



N-l 



qk 



E 4 

k= n y« 



1 d 

1+ 5 + U 



+ 



b-d 



(46) 



Thus, inserting the above asymptotic forms into Eq. (24), the terms around 
y = have a contribution 

s^Sr (47) 

where we do not specify the upper limit as the relevant contribution comes 
from the terms close to the lower limit 



y - ~iv 



dy- 



NlnN . 



(48) 



l/N 
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B 



£fix 



t fix ~ N In N 



B< — A 



b-d 

. AT 3 / 2 

^iV 2 



5 



t fix ~ iV In iV 



tfix ~ 



a — c 



t fix ~ iVlniV 



iV 3 / 2 



< > A 



Fig. 2. Graphical illustration of the different scenarios for the fixation time. The 
large N behavior of the fixation time discussed in Table 2 is displayed (the value 
of 7 > can be read out from Table 2). The thick lines refer to relevant finite size 
corrections to the N — ► oo behavior. 

By taking into account the terms close to the other singularity at y = 1 as 
well, where 



N-l 

S n ,N-l _ Qk_ _ ^ C 



c\ 2 

- + 

a) 



a — c 



(49) 



we arrive the large N asymptotic expression for the fixation time 

d 



tfix 



+ 



b-d. 



NlnN ~ NlnN . 



(50) 



According to numerical evaluation of Eq. (24), the A^ln A^ part is asymptoti- 
cally exact but the amplitude is probably just a very good approximation. 

Now, the B < — A case can be easily obtained by interchanging the players 
A ^ B in the previous case, and by noting that the fixation time is always 
the same for both players as stated in Eq.(25). Hence 



tfix 



c — a 



+ 



d * 
d-b 



NlnN ~ NlnN . 



(51) 



Next we consider the B A case for q(l) < I. The quantity which develops 
the relevant singularity in the A" — > oo limit is l/q n . It is indeed singular at y* 
of Eq. (36), since q n = q(y) N , with y = n/N, and as q{y) takes its minimum 
q* = q(y*) at y*. Similarly to the previous case, so, n -i/so,JV-i —> 1 as A" — > oo, 
for any finite y = n/N. The term s ni 7v_i gains the relevant contribution around 
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y = 1 as long as q(n/N) < q(l), which is the case around the singularity as 
q* < q(l)- This makes s nj Ar_i qN-i+QN-2+- • ' ~ 9(1)^- As we showed in the 
previous case, A n has only weak singularities at y = and y — 1, leading to an 
A/" In A/" contribution to t fix , which can be safely neglected in this case. Hence 
\ n can be considered a constant around y*. Putting now all these asymptotic 
expressions back to Eq. (24) we arrive at 




*h~L ~ vjv , (52) 



where, at the last step, we used the method of steepest descent again. The 
case B A case for q(l) > 1 is entirely similar, with the only difference 
that in the N — > oo limit s n) jv-i/so,iv-i - ^ 1) while s , n -i ^ b/(b — d) =const 
around the singularity y* of l/q n . Hence, in this case 

*fl*~vOv(l) . (53) 



In the remaining B < > A region we have three singularities in the expression 

of Eq. (24) for t fix at y — 0, y* and 1. The contribution coming from the terms 
close to y = and y = 1 is ~ N In N, analogously to the B — ► A case. We 
provide a crude argument that the contribution from the singularity at y* is 
smaller, namely ~ N. 

Let us approximate quantities in Eq. (24) around the singularity y* up to 
second order in 8 = y — y*. Around its maximum q(5) ~ q*(l — /35 2 ), with 
f3 > 0. This makes 

q n (5) » g^ e -^ 2 , (54) 

hence So,at-i ~ \^Nq* N , and in this approximation So,n*-i ~ s n*,7v-i ~ 
s ,at-i/2, where n* = iVy*. Integrating Eq.(54) for small 5 <C l/y/N/3, we 
arrive at 

s 0l „-i ~ VNq* N (l + /fl'v'MS + 0(<5 3 )) , (55) 

where > is a constant, and the same for s nj jv_i, with — /?' instead of (3'. 
Putting these asymptotic expressions into Eq. (24) and noting that A n can be 
considered as constant around y*, we get 

tfix ~ v^E 1 + ~ AT 3 / 2 / dS e^" NS " ~ AT , (56) 

where /3" > is another constant. Together with the contributions from y = 
and 1 the asymptotic behavior is 

U x ~N\nN. (57) 
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R ► A (a > r b > d) 

J-J 7 vj. \ Li L-j 1/ Li' J 


1 - d/b 


yl-U 


~ AT In TV 


r « 4 /n / r h <r rl\ 

±J ' ^ 1 1 Li \ L-j f \ Li J 


IK 1 ) 


\ — nlc 

± Li j L 


~ A7"ln V 

IV 111 J V 


R i > 4 f n > r h < d) 

1 J ~ 7 Ji \ Li s** L-j U \. Li J 


Ivy J 




~ TVln /V 

' ^ IV 111 1 V 


R A in < r b~> <f\ aiW < 1 


X Li / U 




~ (a(l)/o*) N 




y\ / 


1 - a/c 


iv y / 


a = c, b = d 


~ 1/TV 


~ 1/TV 


-TV 2 


(a = c, b > d) or (b = d, a < c) 




~ i/Vjv 


~ TV 3 / 2 


(a = c, b < d) or (b = d, a > c) 


~ l/Vrv 


- ?(!)" 


~ TV 3 / 2 



Table 2 



Fixation probabilities and times in the TV — > oo limit for different scenarios. Note, 
that due to the specific values of q(l) and g*, when the above quantities depend 
exponentially on TV, the fixation probabilities are exponentially small, while the 
fixation times are exponentially large. 

We complete this section as well with investigating the asymptotic behavior 
on the axes. In the origin (a = c and b = d) we asymptotically recover the 
results for the neutral game (Sec. 3.1). As <?j//j ~ ~ 1, s n , m ~ m — n and 
Aj ~ y(l — y) we obtain ig x ~ TV 2 . 

For a = c and b > d the maximum of g(y) is at with <f (0) < 0, hence 
similarly to the B — ► A case, the terms around y = have an TV In TV 
contribution. The relevant contribution though comes from the terms around 
y = 1 where q'(l) = 0. Around y — 1 the ratio So, n -i/so,JV-i ~ 1 and A n ~ 5 
with 5 = 1 — y = 1 — n/N . By approximating g n 9(1) (1 + I35 2 ) up to second 
order with f3 > 0, we obtain 

SfMV _i « ^(1)^(1 + ^5 2 ) . (58) 

Inserting these terms into Eq. (24) and using the steepest descent method we 
arrive at the algebraically decaying asymptotic behavior tg x ~ TV 3 / 2 . The same 
behavior occurs for b = d and a < c, as it follows by interchanging the players. 

The b = d and a > c case goes along similar lines. The relevant contribution in 
this case comes from the terms around y = where q(y) takes its maximum 
and <f (0) = 0. In the leading order \ n ss y, q n 1 — /3i/ 2 with /3 > 0, 
s ni Ar_i/s ,7v-i w 1, and 

So^^TV^l-^V) . (59) 

Using Eq. (24) we arrive at the same algebraic asymptotic behavior t fix ~ TV 3//2 . 
Interchanging the strategies leads to the same result also for a = c and b < d. 
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1 ' 1 1 ' 1 

20 40 60 80 100 

N 

Fig. 3. Illustrative examples for the relative fixation time as a function of population 
size when there is an ESS with coexisting strategies in the infinite population size 

limit, B -^<— A. The payoff matrix is I I , where parameter a varies as 

indicated in the figure. 

These results for the asymptotic behavior of the absorption times are summa- 
rized in Table 2 and also depicted in Fig. 2. 



5 Numerical examples 



In this section we investigate the population size TV dependence of the fixation 
time tfi x based on numerical evaluation of the exact formula Eq. (24). For large 
iV values the asymptotic forms summarized in Table 2 are recovered. Relevant 
deviations from these asymptotic forms are present close to the boundaries of 
the two types (exponential and N In N) of behavior, namely at the boundaries 
of the B >< A region, marked with thick lines in Fig. (2). 



As a typical example for large finite size effects we consider payoff matrices 
(a 4] 

of the form with a < 3, which then belong to the B A case. For 

V 3 V 

a = 1, one sees the exponentially growing fixation time in Fig. 3. As a — ► 3, 
that is as we approach the boundary a — c = in Fig. 2, the exponential growth 
can be observed only for larger population sizes. As it is shown in Fig. 3 for 
small population sizes, tfi x can even be smaller than iV(iV — 1), the fixation 
time for neutral games. For smaller c — a values the exponential growth of 
the fixation time starts at larger iV values. The same finite size behavior is 
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Fig. 4. Illustrative examples for the relative fixation time as a function of population 
size when there are only one ESS-s in the infinite population size limit, B — ► A. 

.52 

The payoff matrix is | ) , where parameter c varies as indicated in the figure. 

; 1 

observed as we approach the b — d = boundary from the same B A 
region. 

Finite size effects are also relevant in the B — > A region when approaching 
the a — c = boundary. In this case the asymptotically slow (~ NlnN) 
fixation time can exceed the neutral value N(N — 1) for small population sizes 
N. This behavior can be observed on an example in Fig. 4. Again, in general, 
for smaller a — c values the fixation time exceeds that of the neutral game 
up to larger values of N. The same scenario occurs in the B < — A region as 
d — b ^ 0. Conversely, finite size effects are not relevant at the boundaries of 

the B < ► A region. This fact is not surprising as the large N behavior of 

the fixation time is also ~ NlnN in the neighboring domains, and even on 
the boundaries it is still smaller (iV 3//2 ) than in the neutral game. 

Now we make three conjectures about the fixation time for finite N values. 
These conjectures are based on numerical evaluation of Eq. (24) for numerous 
(~ 50000) random (all elements are from the interval [0, 1]) payoff matrices in 
for each system sizes from N = 2 to N = 100. First conjecture: ta x (N) < 

N(N — 1) for all values of N in the B < > A region. The equality (tf^(N) = 

N(N — 1)) was only observed in the trivial N = 2 case. Second conjecture: 
if ejv-i > 1/N and > 1/N then t fix (iV) > N(N - 1). Third conjecture: 
if f\ - 9l < and - ^-i > then t &x (N) < N(N - 1). 

Based on the above conjectures we can predict the speed of the process in 



three out of the five evolutionary scenarios established in (jTavlor. C. et al 
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2004). According to our 2. conjecture if mutual invasion and mutual fixation is 



favored then fixation was experienced to be slow without exception (case 2. in 
Table 1). Thus a state with coexisting strategies is likely to b e observed in this 
situa tion. This is a qualitatively new situation not present in (Tayl or. C. et all 
2004 ) . If selection opposes mutual invasion then fixation is fast (3. conjecture), 



independently whether selection favors fixation or not (cases 4. 5. in Table 1.) 
This conjecture is also supported by the large iV asymptotics (see Fig. 2. 



6 Discussion 



We have used a recently introduced frequency dependent Moran process to 
study game dynamics in finite populations. The probability that a single mu- 
tant strategy A can fixate in a population consist i ng N — 1 individuals with 
strategy B has been k nown ( Karlin Sz Tavlor. H. . 19751 : Nowak et al. . 2004 : 



Tavlor. C. et alll2004h . The stationary distribution for this system is studied 



recently. It is shown nu merically that this distr i butio n can deviate significantly 
from the Gaussian one ( Claussen Sz Traulsen . 2005| ). We studied the average 
time to fixation in this paper. We have shown that the fixation time is iden- 
tical for the two strategies for a given game independently of the elements of 
the payoff matrix and the population size, that is ig x (AQ = tjv-i = tf. In fact 
this property is necessary for fixation probabilities to meaningfully character- 
ize selection scenarios. As an example consider a process where the fixation 
probabilities are the same for the two strategies (ejv_i — Then in order 
to conclude that t he population is observe d with the same probability in both 
absorbing states ( Fudenberg et al. . 20051 ) it is necessary to know that both 
absorption times are the same. 



Since the exact results for the fixation time and fixation probabilities are overly 
complicated as a function of the payoff matrix, we also determined their large 
N asymptotic behavior. We have shown that tfi x ~ NlnN if there is at least 
one pure ESS in the infinite population size limit. Thus, the fixation time 
becomes negligibly smaller than that of the neutral game as the population 
size goes to infinity (In N/(N — 1) — > if iV — > oo). Interestingly, fixation is 
fast even if the system is bistable (there are two ESS-s), although to arrive 
from the neighborhood of one of the stable states to the other one is impossible 
i n the standard deterministic replicator dynamics of the infinite populations 
(Hof bauer fc Sigmund . 20031 ). On the other hand we have shown that the 
fixation probability of a mutant is exponentially small in the large population 
size limit. A particularly surprising result is that, as the population size grows, 
the fixation probability of a dominating strategy tends to a constant less than 
one. This probability depends only on the relative payoff of the non-dominant 
strategy against itself to the payoff of the dominant strategy against the non 
dominant one (see Table 2). Consequently, in real stochastic populations (even 
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in very large ones) the probability of fixation of a dominant mutant can be very 
small when the mutant strategy receives only slightly bigger payoff against the 
resident strategy than the resident does against itself. One can gain an intuitive 
insight into this result. When the mutant is rare, it meets with the resident 
strategy much more frequently than with another rare mutants. If the mutant 
has only a small advantage in this state then it can extinct easily. This is why 
the fixation probability of the dominant mutant is small in this case. In the 
reverse case, if the mutant plays much more effectively against the resident 
strategy than the resident itself, then the rare mutant will spread with high 
probability, so their fixation probability is close to one. 

If strategies A and B coexist in the infinite size deterministic model then 
the fixation time increases exponentially with N, that is the fixation time 
tends to infinity compared to the neutral game ((e N / [Vn (N — 1)] — > oo). 
Hence in the large population limit A and B remain practically in a state 
of coexisting strategies. Depending on a complex relation among the payoff 
elements {q(l) ^ 1) one of the strategies fixates with a constant probability 
while the fixation probability of the other decreases exponentially with N 
(Table 2.). While fixation is slow in this case, it is probable for a rare mutant 
that plays effectively against the resident strategy (see above). 



Our results about the bistable case have some consequences in the context 
of the origin of cooperation. The origin of cooperative behavior amo ng selfish 
indiyiduals remains an intriguing problem of evolutionary biology (lAxelrodl. 



1981 



Mavnard Smith fc Szathmarv , 19951 : Szabo et al. , 2004 ; Nowak fc Sigmundl . 



2004 ). U s ing t he iterated pri soner's dilemma ga me as a conceptual model 



f)Axelrodl . I 198l| ) Nowak et al. ((Nowak et all l2004| ) considered a game where a 
cooperative (Tit-for-Tat) strategy plays against an always defective strategy 
(A11D). In this situation both the Tit-for-Tat and A11D strategies are stable 
fixed points of the classical replicator dynamics. Consequently, although a 
purely cooperative population is a stable state, Tit-for-Tat can not spread in 
a defective population. However, this argument is true only for infinite deter- 
ministic populations. Nowak et al. demonstrated, using the model studied in 
the present paper, that a single mutant cooperative Tit-for-Tat can replace 
the N — 1 A11D players with high probability, for appropriate A" values (not 
too small, not too large). This means in our context, that the fixation proba- 
bility of Tit-for-Tat could be four or five times greater than it would be in the 
neutral case (1/N). On the other hand, replacement of Tit-for-Tat by A11D is 
unlikely since fixation probability of A11D is smaller than 1/N for appropriate 
N-s. That is, selection follow the scenario (4) in Table 1, where Tit-for-Tat 
corresponds to strategy B. Nevertheless, the speed of the fixation of Tit-for- 
Tat remained an open question. According to our third conjecture and our 
analysis of the large A" limit, however, if selection opposes mutual invasion 
then the fixation is fast (cases 4 and 5 in Table 1). Consequently, if the pop- 
ulation size is appropriate for the fixation of the cooperative strategy, then 
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this fixation will be fast, making the spread and fixation of the cooperative 
strategy more likely in finite populations. 
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Appendix 

In Eq.(25) we arrived at the surprising result that the fixation times are always 
the same for both species. In this section we provide a simple independent 
argument to support this result. Say we have a particular walk s, which starts 
from site 1 at time 0, and arrives at site N for the first time at time T(s), 
without having stepped on site 0. It is obvious that this walk has to step 
through each bond to the right (towards site N) at least once. It is also clear 
that the walk steps one more times to the right then to the left through each 
bond, as eventually the walk ends up at site N. This means that the probability 
P(s) of the walk s can be written as 

N-2 N-1 N-1 

p(s) = xn-i n \- +c Wi n "f* = *oo n A * > ( 6 °) 

i=l i=l i=l 

with ccj and fa being non- negative integers, = 1 — ^ — \, and we used the 
notation 

*(*)= nW+ir lk* • (6i) 

i=l i=l 

Now for each walk s we can uniquely construct a walk s, which walks backward 
from site N — 1 to site on the path of s, that is s t = N — s T ( s )-i-t for 
< t < T(s), and s T ( s ) = N. The duration of the two walks are obviously 
the same T(s) = T(s). Since s steps through each bond backward compared 
to s, its probability can be obtained by interchanging all Aj by and vice 
versa, apart from the last steps of each walk, where we have to change A at to 
/ii, that is 

N-2 N-1 N-1 

p(s) = /xi n n ^ = n ^ ^ 

i=i i=i i=i 
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In order to calculate the average conditional first passage time tf we have to 
sum over all path s, which starts from site 1 and exits at site N, that is 

ET(s)P(s) ET(s)n(s) 

.+ _ W _ {S} 

E P(s) ~ E As) ■ {b6) 

W W 

Similarly, by summing over the walks which start at site N — 1 end exit at 
site we arrive at 

ET(s)P(s) ET( S )tt( S ) 

tN -' = ep(s) = ~^ur ■ (64) 

« {5} 

Since there is a one to one correspondence between {s} and {§}, the two sums 
are the same, that is tf = ijv-i, what we wanted to demonstrate. 

It is interesting to note that the same argument also applies to the conditional 
first passage probabilities 

E 5 T ( s ), T P(s) E 5t(s),tP(s) 

^'^ '"EfC) = ffl SPW = r) (65) 

{*} {s} 

that is not only the average first passage time, but the whole conditional first 
passage probability to arrive at site N from site 1 in time r is the same as 
that to arrive at site from site iV — 1 in time r. 
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